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Abstract 

In this paper, a compressive sensing (CS) approach is proposed for sparse binary signals’ com¬ 
pression and reconstruction based on analog fountain codes (AFCs). In the proposed scheme, referred 
to as the analog fountain compressive sensing (AFCS), each measurement is generated from a linear 
combination of L randomly selected binary signal elements with real weight coefficients. The weight 
coefficients are chosen from a finite weight set and L, called measurement degree, is obtained based on a 
predefined degree distribution function. We propose a simple verification based reconstruction algorithm 
for the AFCS in the noiseless case. The proposed verification based decoder is analyzed through SUM- 
OR tree analytical approach and an optimization problem is formulated to find the optimum measurement 
degree to minimize the number of measurements required for the reconstruction of binary sparse signals. 

We show that in the AFCS, the number of required measurements is of 0{—n\og{l — k/n)), where n is 
the signal length and k is the signal sparsity level. We then consider the signal reconstruction of AFCS 
in the presence of additive white Gaussian noise (AWGN) and the standard message passing decoder 
is then used for the signal recovery. Simulation results show that the AFCS can perfectly recover all 
non-zero elements of the sparse binary signal with a significantly reduced number of measurements, 
compared to the conventional binary CS and -minimization approaches in a wide range of signal to 
noise ratios (SNRs). Finally, we show a practical application of the AFCS for the sparse event detection 
in wireless sensor networks (WSNs), where the sensors’ readings can be treated as measurements from 
the CS point of view. 
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1. Introduction 

Compressive sensing (CS) has drawn a lot of interests in reeent years as a revolutionary 
signal sampling paradigm. The main idea behind CS is the possibility to reconstruet a sparse 
signal from an under-determined system of linear equations through a eomputationally effieient 
eonvex programming algorithm. Let us eonsider the reeonstruetion problem of a sparse signal 
X G M” from the linear measurements y G W^, where m < n, y = Gx, and G G is the 

measurement matrix, modeling the measurement (sensing) proeess. We assume that the signal x 
is /c-sparse, whieh means that exaetly k n elements of x are nonzero [[D, but we do not know 
the loeations of the nonzero entries of x. Due to the sparsity of x, one could try to compute x 
by solving the following optimization problem: 

min||x||o s.t. Gx = y. (1) 

X 

However, solving ([T]) is an NP-hard problem and thus practically not feasible O. Instead, a 
convex relaxation problem can be formulated as follows, which can be solved efficiently via 
linear or quadratic programming techniques. 

min||x||i s.t. Gx = y. (2) 

X 

As shown in [lH |3l IH [5]|, under certain conditions on the matrix G and the sparsity of x, both Q 
and Q have the same unique solution. The Restricted Isometry Property (RIP) and the coherence 
of matrix G are to date the most widely used tools to derive such conditions. A comprehensive 
introduction to compressive sensing can be found in excellent review papers O |7||. 

Unlike the conventional CS theory which considers only the sparsity of the signal, in [|8l a 
more realistic signal model and CS approach, referred to as the Model-based CS, was introduced 
by considering some structure among the signal coefficients. Model-based CS and its variations 
has been widely studied in the literature Il8l|9l[l0l[lll[l2l[l3l[l4l|. Due to the known structure of 
the sparse signal in model-based CS, signal models provide two major advantages to CS. First, 
the number of measurements required for the reconstruction of the sparse signal is significantly 
reduced. Second, a more accurate and robust recovery is achieved as we can better differentiate 
true signal information from recovery artifacts [[8]|. In this paper, we are particularly interested 
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in the reconstruction of sparse binary signals using linear measurements. Binary CS is a special 
case of model based CS which has many applications, such as event detection in wireless sensor 
networks (WSNs), group testing, and spectrum sensing for cognitive radios. More practical 
settings and similar discrete types of compressive sensing scenarios can be found in IfTSl [TO [TTl 
HSl SlEQlllIli. More specifically, in binary CS, the nonzero elements of the sparse signal have 
the same value of 1, and the objective is to minimize the number of real valued measurements 
linearly generated from the binary sparse signal. 

The -minimization solution for the noiseless binary CS has been proposed in liTTl . where 
problem in Q is solved by limiting the reconstructed signal in the range [0,1]. Also, by using 
an integer programming, [|T5l proposed a model to solve a system of linear equations with 
integer variables. In ll2TI a simple verification-based decoder has been proposed for binary CS 
problems, where the measurement matrix is sparse and its elements are randomly obtained based 
on a predefined probability distribution function. However, the number of required measurements 
in these approaches are still very large, as they do not take the binary structure of the signal 
into the design of the CS approaches. 

In this paper, we propose a binary CS approach based on analog fountain codes (AFCs) ^22^ . 
AFCs were originally designed to approach the capacity of the Gaussian channel across a wide 
range of signal to noise ratios (SNRs). Each AFC coded symbol is a linear combination of 
randomly selected information symbols with real weight coefficients. With the proper design of 
the weight coefficients, each binary linear equation associated with each AFC coded symbol will 
have a unique solution. As shown in the AFC decoding problem is equivalent to solving 
a system of binary linear equations at high SNRs, which is the same as the reconstruction of a 
binary sparse signal from noiseless linear measurements. This interesting property of AFC codes 
motivates us to incorporate them in the design of an efficient binary CS approach to minimize 
the number of measurements required for the successful recovery of the sparse signal. 

In the proposed analog fountain compressive sensing (AFCS) approach, each measurement is 
generated from a number of randomly selected binary information symbols with weight coeffi¬ 
cients selected from a finite weight set. Thanks to the random structure of AFC codes, the number 
of generated measurements is potentially limitless and the transmitter can keep transmitting 
the measurements till the destination fully recovers the sparse binary signal. As a result, the 
transmitter in the proposed binary CS can adapt to the time-varying channel condition and thus 
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can maximize the sensing and transmission effieieney. Due to the sparsity and the faet that weight 
eoeffieients do not introduee redundaney, eaeh measurement is eonneeted to a very small number 
of non-zero signal eomponents in the respeetive weighted bipartite graph; thus, information 
symbols ean be reeovered by eomparing the measurement value with the weight eoeffieients 
assigned to it in an iterative manner. More speeifieally, we propose a reeonstruetion algorithm, 
where eaeh measurement ean determine its eonneeted variable nodes if it has eonneetion with 
at most T nonzero signal elements in the bipartite graph, where T > 0 is a predefined value. 
To minimize the number of required measurements, we propose an optimization problem to find 
the optimum measurement degree, defined as the number of information symbols eonneeted to 
eaeh measurement or simply the row weight of the measurement matrix. The minimum number 
of required measurements is then approximated and shown to be of 0{—n\og{l — s)), where 
n is the signal length and s = k/n is the signal sparsity order. Simulation results show that the 
proposed seheme has a signifieantly lower error rate eompared to the £i-minimization approaeh 
ifTTll and the binary CS seheme in [|2T1. 

We further extend the proposed AFCS seheme to the noisy CS problem, where the mea¬ 
surements are reeeived in the presenee of additive white Gaussian noise (AWGN). We propose 
a message passing deeoder to reeonstruet the binary sparse signal and show that the AFCS 
ean reeover the sparse signal with a very small number of measurements eompared to the ii- 
minimization approaeh IfTTll . More speeifieally, the proposed AFCS seheme requires less number 
of measurements to eompletely reeover the binary sparse signal in a wide range of SNRs even 
with a lower measurement degree eompared to the £ 1 -minimization approaeh IfTTll . 

As a direet applieation of the AFCS, we then eonsider sparse event deteetion (SED) in WSNs, 
where a set of monitoring sensors try to eapture a small number of aetive events randomly 
distributed in the sensing field lf23l . CS meehanisms are benefieial to WSNs as sensor nodes 
usually have eonstrained sensing, limited memory and proeessing eapability, as well as small 
eommunieation bandwidth ||2^ . Several approaehes have used CS theory for SED problem, but 
they are impraetieal in some seenarios. Eor example, Bayesian eompressive sensing (BCS) has 
been used in ll25ll to solve the SED problem in WSNs. However, BCS is known to be sensitive to 
noise, leading to a eonsiderable degraded performanee in low SNRs. To solve this problem, ll26ll 
applied sequential eompressive sensing (SCS) to model the SED problem as a support reeovery 
problem and solve it in an adaptive manner. However, in this approaeh, a large number of sensor 
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nodes (about half that of event sourees) is required to be deployed in the sensing field in order 
to guarantee its performanee. This will also bring the system a signifieant eommunieation eost 
due to a large number of sensor nodes. 

In this paper, we revisit the sparse event deteetion problem in WSNs from the eoding theory 
perspeetive and solve the problem as a deeoding problem of AFCS. We formulate the variable 
and eheek node degree distributions for two seenarios of random and uniform deployment of 
sensor nodes. The minimum number of sensor nodes required to fully deteet all aetive events in 
the noiseless ease is then found as a funetion of the sensing eoverage radius and the total area of 
the sensing field. We show that the proposed AFCS seheme approaehes this lower bound in high 
SNRs. We further analyze the probability of false deteetion for the proposed seheme and show 
that the AFCS aehieves a very low probability of false deteetion at different SNRs. Simulation 
results show that the proposed seheme outperforms the SCS ll26l and BCS [[25ll approaehes in 
terms of the probability of eorreet deteetion at different SNRs and the minimum sampling ratio 
to suoeessfully deteet the aetive event sourees at different sampling ratios. 

Throughout the paper the following notations are used. We use a boldfaee small letter to 
denote a veetor and the entry of veetor v is denoted by Uj. A boldfaee eapital letter is used to 
denote a matrix and the element in the row of matrix G is denoted by gij. The transpose 
of matrix G is denoted by G'. A set is denoted by a eurly eapital letter and the eardinality of 
set S is represented by |iS|. 

The rest of the paper is organized as follows. In Seetion II, we introduee the AFCS approaeh 
and the general reeonstruetion algorithm for the AFCS seheme. The analysis of the proposed 
reeonstruetion algorithm in the noiseless ease is then presented in Seetion III, followed by 
the optimization of the measurement degree and an approximation for the number of required 
measurements. A message passing deeoder is then proposed for AFCS in the presenee of additive 
white Gaussian noise in Seetion IV. In Seetion V, we demonstrate the applieation of AFCS in 
WSNs for sparse events deteetion. Seetion VII provides some diseussion on the praetieal issues 
for sparse event deteetion in WSNs. Simulation results are shown in Seetion VI. Finally Seetion 
VII eoneludes the paper. 
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TABLE I 

Notation Summary 


Notatioi 

1 Description 

n 

Length of the binary sparse signal (number of event sources) 

m 

Number of measurements (sensor nodes) 

k 

Number of nonzero elements of the binary sparse signal 

(number of active events) 

s 

Sparsity order, s = k/n 

/3 

Sampling ratio, 0 = m/n 

L 

Measurement (check node) degree 

dy 

Variable node degree 

b 

binary sparse signal of dimension n 

Gtti, X n 

m X n measurement matrix, G = [gij] 

w 

Weight set of AFCS with size D 

T 

Maximum number of weight coefficients included in each 

operation of the decoding apgorithms 

S 

Monitoring region of ai'ea S 

e 

binary spai'se event signal of dimension n 

Hm X n 

m X n channel matrix, H = [hij] 

hij 

The channel between the sensor node and the event 

source 

dij 

The distance between the sensor node and the event 

source 

a 

Propagation loss factor 


Thermal measurement noise at the sensor 

A 

Received SNR at the sink node 

Rs 

Sensor coverage radius 

PCD 

Probability of connect detection 

PFD 

Probability of false detecttion 


II. Analog Fountain Compressive Sensing 


We assume that a binary signal veetor of length n, b, is strietly fc-sparse, whieh means that 
the number of non-zero elements of b is exaetly k. The sparsity order of b is then defined as 
s = k/n. Table 1^ provides a summary of notations for quiek referenee. 

Let us first briefly introduee the analog fountain eode (AFC). AFCs were originally proposed 
in [l22l 123 as an effeetive adaptive transmission seheme to approaeh the eapaeity of wireless 
ehannels. In AFC, to generate eaeh eoded symbol L information symbols are seleeted uniformly 
at random and then linearly eombined with real weight eoeffieients. For simplieity, we assume 
that weight eoeffieients are ehosen from a finite weight set, W, with D > L positive real 
members. 


























7 


Let G denote the generator matrix of dimension m x n, where m is the number of eoded 
symbols, then AFC eoded symbols, c, are generated as follows: 

c = Gb, (3) 

where, only L elements in eaeh row of matrix G are nonzero and eaeh nonzero element of G is 
randomly ehosen from the weight set W. As shown in ll23 . to improve AFC eode performanee, 
the generator matrix should have uniform column degrees, which can be simply achieved by 
selecting information symbols from those which have currently smallest degrees when generating 
each coded symbol. This way, each information symbol is connected to a fixed number of coded 
symbols; leading to a regular matrix G. 

A. Measurement Matrix Construction 

Each row of the measurement matrix is generated in a similar way that a coded symbol is 
generated in AFCs, that is first an integer L, called measurement degree, is obtained based on a 
predefined probability mass function, called degree distribution. For simplicity, we assume that 
the measurement degree, i.e., the number of non-zero elements in each row of the measurement 
matrix, is a fixed value. Let gj denote the row of the measurement matrix G; then, only 
L entries of gj have nonzero values. The value of nonzero elements of gj are also randomly 
obtained from the weight set W without the repetition. Although the weight coefficients can be 
obtained based on a predefined probability distribution function, for simplicity we assume that 
the weight set W has D > L positive real members as follows: 

W = {wi,W2, ...,Wd}- 

In the next subsection, we show how the weight set W should be defined. The measurements 
are then calculated by c = Gb, which can be graphically represented by a weighted bipartite 
graph with elements of the binary signal b and the measurements as variable and check nodes, 
respectively. In the bipartite graph, the variable node j is connected to the check node i, if gij 
is non-zero, and the edge between them is also assigned with the value of gij. Fig. shows the 
weighted bipartite graph of the proposed binary CS scheme, where the measurement degree is 
L = 2. 

As we consider that the locations of non-zero elements in each row of the measurement matrix 
are selected quite randomly, it is highly probable that a variable node is not connected to any 
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Fig. 1. The weighted bipartite graph of the AFCS scheme. Black and white circles respectively show the nonzero and zero 
signal elements. 

check node in the corresponding bipartite graph, especially when the number of measurements 
is relatively low. More specifically, the probability that a variable node is not connected to any 
check node is [[28l . This problem significantly increases the error rate for the CS schemes 

in uniiiii. 

To overcome this problem, we modify the measurement matrix in a way that each variable 
node is connected to at least one check node. Moreover, to ensure that all the variable nodes 
have the same chance to be recovered, the degrees of all variable nodes are chosen to be the 
same. To achieve this, we select the variable nodes among those with the lowest degrees when 
generating each measurement. In this way, the variable node degrees will be either dy or dy — 1, 
where dy is the smallest integer larger than or equal to mL/n [|28l. Later in Section V, we show 
how this imposed structure in the measurement matrix can be obtained in real applications, 
which significantly increases the sensing efficiency and improve the reconstruction capability. 
It is important to note that the measurement matrix is not fixed and randomly generated for 
each experiment. In fact, the measurement matrix is generated in a way to have uniform column 
and row degrees, where the nonzero elements of the measurement matrix are selected from the 
predefined weight set. 

B. Weight Set Design 

It is important to note that the weight set should be designed such that each measurement 
can uniquely determine its connected variable nodes. For instance, let us assume that Wi + 
W 2 = tcs and a measurement has a value of Wj, and is connected to hi, 62 , and 63 with weight 
coefficients, wi, W 2 , and rcs, respectively. The decoder then try to solve the binary linear equation 
Wihi + tC 2&2 + Wi.h'i = W 3 . This measurement can determine either hi = 62 = 1,^3 = 0, or 
hi = h 2 = 0 ,63 = 1, i.e., two possible solutions. To guarantee that each measurement can 
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uniquely verify its eonneeted variable nodes, it is suffieient that the weight set W satisfies the 
following eondition: 


D 



(4) 


for all V G {—1, 0,1}^, where v ^ {0}^, and G W for i = 1, D. As stated in [|22l . each 
measurement is associated with a binary linear equation and thus the decoder task is actually 
to solve a system of binary linear equations in the noiseless case. Condition Q guarantees that 
the linear equation associated with each measurement has a unique solution. 

Here, we assume that the nonzero elements of the measurement matrix are sampled from a 
finite weight set, where the weight set satisfies Q. It is clear that when the members of the 
weight set are sampled from a continuous probability distribution function, Q can be easily 
satisfied. In practice, the elements of the measurement matrix depend on the physical condition 
of the system, which is mainly random. For example, as we will later see in Section V, in the 
sparse event detection in WSNs, the elements of the measurement matrix are channel gains, 
which are random due to shadowing and small scale fading; thus, condition Q is satisfied and 
each measurement can uniquely determine its connected variable nodes. In the rest of the paper, 
we use weight coefficients which have been obtained from a zero mean Gaussian distribution 
with variance 1 , unless otherwise specified. 

It is important to note that in conventional CS theory, restricted isometry property (RIP) is 
sufficient to guarantee sparse reconstruction by -minimization approach [|2l|. It is also known 
that £i-minimization reconstructs every sparse signal precisely when the sensing matrix satisfies 
the null space property (NSP) 0. In general, the null space of measurement matrix G is defined 
as A/'(G) = {z|Gz = 0}. If we want to recover all sparse signals x from the measurements Gx, 
it is clear that for any pair of distinct vectors x and x', we must have Gx ^ Gx', since otherwise 
it would be impossible to distinguish x from x'. In other words, to recover all sparse signals 
from the measurements generated by matrix G, we must have A/'(G) = {0}, where 0 is the zero 
vector of dimension n. It is worth mentioning that if the weight coefficients in AFCS satisfy 
condition Q, then each sparse binary signal is uniquely recovered form its linear measurements. 
In other words, the binary structure of the sparse signal enables us to design the measurement 
matrix which can precisely satisfy the null sparse property (7V(G) = {0}) for all binary signals, 
which is sufficient for compressive sensing. 
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C. Verification Based Reconstruction 

It is clear from Q that a zero valued measurement can uniquely verify its connected variable 
nodes as zero signal elements. In fact, condition Q guarantees that a binary linear equation with 
weight coefficients from W has a unique solution of zero if and only if the binary variables are 
all zero [l29l l30l . Also due to signal sparsity, only few nonzero signal elements are connected 
to each measurement. Therefore, by comparing the value of a measurement and the sum of 
different numbers of its weight coefficients, we can uniquely determine the values of its connected 
variable nodes. Based on these observations, we then propose the following verification based 
reconstruction algorithm, referred to as the Sum Verification Decoder. 

Let us consider the binary compressive sensing problem c = Gb, where the binary sparse signal 
vector b„xi has to be reconstructed from linear measurements c^xi and a given measurement 
matrix G^xn- We use the notation A/i to denote the set of variable nodes connected to check 
node Cj, and use to denote the set of connected variable nodes to check node Cj, excluding 
the variable nodes in the set B. The operations of the Sum Verification Decoder can then be 
summarized as follows, for a given T > 0: 

Sum Verification Decoder: Construct the weighted bipartite graph from the measurement 
matrix G. Construct a matrix Gt, where its rows correspond to all binary vectors of dimension 
D with at most T nonzero elements. Construct a vector w = [wi,W 2 , ■■■iWd]'. Compute the 
vector u as follows: u = G^w. Initially, we assume that b is a zero vector of dimension n. 

1. Find a measurement Cj with a nonzero degree which has a value equal to one of the elements 
of vector u (if there is no such measurement, this decoding algorithm halts at this point 
and fails to recover all the variable nodes). Find the set of its connected variable nodes Mi. 

a) Find the number of elements in A/), i.e. \Mi\ = r, where Mi = {ji, ■■■fir}- Construct a 

matrix F, where its rows correspond to all binary vectors of dimension r with at most 
T nonzero elements. Construct a vector gi,j 2 , Compute the vector 

z as follows: z =rg-^\ Find ^ such that zi = Ci. 

b) Set 6^;, = for r = 1, 2,..., r. 

2. Remove all the edges connected to verified variable nodes bj^. 

3. Update the measurement values to c = c — Gb. 

4. Repeat (1), (2), and (3) until all bjs are determined 
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Fig. 2. The sum-verification reconstruction algorithm for L = 4 and t = 2. Black and white circles show the non-zero and 
zero signal elements, respectively. 

In each iteration of the proposed decoder, a measurement which has a connection to at most 
T nonzero variable nodes is found (Step 1). Since the weights are chosen from a fixed weight 
set, this can be achieved by simply comparing the value of the measurements with a table 
consisting of all the possible values of the summation of at most T weight coefficients. If 
such a measurement cannot be found, the decoder will be terminated and the variable nodes 
cannot be fully recovered. In the next step, the variable nodes that are connected to the selected 
measurement will be uniquely recovered (Step 1, (a) and (b)). Then, the edges connected to the 
recovered variable nodes are removed from the bipartite graph (Step 2). Also, the measurement 
values will be updated to cancel out the effect of the recovered variable nodes on the remaining 
measurements (Step 3). This algorithm is repeated until all the variable nodes are recovered. 

Since the signal is sparse, the probability that a measurement is connected to a large number 
of nonzero elements of the signal is very low. In other words, a majority of measurements are 
connected to a very small number of nonzero variable nodes. Therefore, it is reasonable to only 
consider small values of T, which also avoids huge complexity at the decoder. More specifically, 
in this paper we only consider T = 0, 1, and 2. Fig. shows the proposed reconstruction 
algorithm for the case that the measurement degree is L = 4 and T = 2. As can be seen in this 
figure, measurement c* = is connected to hi, h 2 , b^, and bn- 2 , and by comparing the 

measurement value with vector z, all connected variable nodes can be uniquely determined. 

III. Performance Analysis of the Verification Based Decoder 

In this section, the performance of the sum verification decoder is analyzed for different values 
of T. We then find the optimum measurement degree as a function of the sparsity order, s. An 
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Fig. 3. SUM-OR tree Gt when d and j indicate the number of edges connected to the specific node. Circles and squares show 
OR and SUM nodes, respectively. 

upper bound is also derived for the number of measurements required to suoeessfully reeover 
all variable nodes. 

A. Analysis of the Verification Probability 

Let Q denote the bipartite graph eorresponding to the eode at the reeeiver. We ehoose subgraph 
Qi as following. Choose an edge (u, w) uniformly at random from all edges. Call the variable 
node V the root of Qi. Subgraph is the graph indueed by v and all neighbors of v within 
distanee 2^ after removing the edge {v;w). As shown in m,Gi is a tree asymptotieally, so we 
refer to Gi as a SUM-OR tree, where SUM and OR nodes refer to eheek and variable nodes, 
respeetively. Fig. shows the SUM-OR tree of depth 2i, where the root of the tree is at depth 
0 and has ehildren at depth 1. More speeifieally, a node at depth t has ehildren at depth t + 1 
for f = 0,..., 2£ — 1 and nodes at depths 0,2,.., 2£ are referred to as OR nodes, while nodes at 
depths 1,3,..., 2£ — 1 are referred to as SUM nodes. 

In the following, we rephrase the proposed sum verifieation deeoder by using the SUM-OR 
tree. At every iteration of the algorithm, messages (0 or 1) are passed along the edges from 
eheek nodes to variable nodes, and then from variable nodes to eheek nodes. A variable node 
sends 0 to an adjaeent eheek node if and only if its value is not reeovered yet. Similarly, a eheek 
node sends 0 to an adjaeent variable node if and only if it is not able to reeover the value of 
the variable node. In other words, a variable node sends 1 to a neighboring eheek node if and 
only if it has reeeived at least one message with value 1 from its other neighboring eheek nodes. 
Also, a eheek node sends 0 to a neighboring variable node if and only if it is eonneeted to more 
than T unreeovered variable nodes. 

Sinee, a subgraph expanded from eaeh variable node is asymptotieally a tree, then the message 
exehange between variable and eheek nodes in the bipartite graph ean be seen as the message 
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exchange between SUM and OR nodes in the tree. More specifically, the messages from OR 
nodes at depth 2i are passed to SUM nodes at depth 2i — 1, which calculate new messages to 
forward to OR nodes at depth 2i — 2, corresponding to one iteration of the decoding algorithm. 
Therefore, if we map the first iteration of the decoding algorithm to the message exchange 
between OR and SUM nodes at depths 21 and 2^ — 1, respectively, then after i iterations, the 
root of the tree will receive messages from its children. The decoding error probability is then 
the probability that the root of the tree does not receive at least one message of value 1 from 
its children. The following lemma gives the decoding probability which is equivalent to the 
probability that the root of the SUM-OR tree does not receive a message of value 1 from its 
children. 

Lemma 1: We consider the binary AFCS, where the measurements are generated by the AFCS 
process. Let pt denote the probability that a variable node is recovered after i iterations of the sum 
verification decoder, where po = 0. Let and denote the probability that an unrecovered 
variable node in the iteration of the reconstruction algorithm is zero and one, respectively, 
where q^'^ = 1 — Then for a fixed measurement degree L, pi is calculated as follows: 


Pi = l-6{1- fi), 


(5) 


where 5(a:) = ^ • 6 jX\ 


d A 


i=0 j=0 


d\ f d — i 
j 


p-_i (gS(l -Pi-i)) 


d—i 





( 6 ) 


d = L — 1, Xi = q^j^^/qf \ A = min{T — l,d — ij, = 1 — and d(x) is the variable node 
degree distribution function with respect to the edges. 

The proof of this lemma is provided in Appendix]^ To complete the analysis, we only need 
to find S(x). As we assume that each measurement is of degree L, variable nodes are either of 
degree d^ or d^ — 1, where d^ is the smallest integer larger than or equal to (3L and [3 is the 
sampling ratio, defined as the ratio of the number of measurements and that of variable nodes. 
Let vi and V 2 denote the probability that a variable node is of degree d^ and d^ — l, respectively, 
then Vi + V 2 = l and 


Vidy + V2{dy - 1) = /3L, 


(7) 
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which results in ui = l — dy+13L and V 2 = d^— [3L. Thus, the variable node degree distribution, 
A(a;), ean be written as follows: 

A(a;) = {1 — dy + l3L)x'^'’ + {dy — {3L)x'^'’~^. 

Aeeordingly, ^(a;), ean be ealeulated as follows OTII : 

, A'(x) VidyX'^^~^ + V 2 {dy — l)x'^^~‘^ 

^ =- Tl -’ 

where A'(a;) is the derivative of A(a;) with respeet to x. 

It is important to note that if T = L, then aeeording to ([^ we have fi = 1, and aeeordingly 
Pi = 1. This means that eaeh eheek node ean uniquely determine its eonneeted variable nodes 
if the parameter T equals to the measurement degree in the sum verifieation deeoder. 

B. Optimization of the Measurement Degree 

Aeeording to Lemma we ean formulate an optimization problem to find the optimum 
measurement degree in order to minimize the number of measurements required for the sueeessful 
reeovery of all variable nodes. However, as ean be seen in this lemma, the verifieation probability 
is not a linear funetion of the measurement degree; thus, finding the optimum degree is not 
straightforward. To overeome this problem, we propose a suboptimum solution to find the 
measurement degree to maximize the number of reeovered variable nodes in eaeh iteration of 
the sum verifieation deeoder. More speeifieally, in the first iteration of the deeoding algorithm, 
we have po = 0, = 1 — k/n, and Aq = ^/(l — thus, we have: 



In other words, a measurement ean determine its eonneeted variable nodes if the number of 
nonzero variable nodes eonneeted to that measurement is at most T. Thus, @ ean be rewritten 
as follows: 



Let go = whieh equals to 1 — then the average number of variable nodes, R, whieh ean 
be verified by eaeh measurement, ean be ealeulated as follows: 

T 

j=0 ^ 


(10) 
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Fig. 4. The optimum measurement degree versus the sparsity order of the signal. Solid lines show the optimum values, and 
dashed line shows the approximation which has been obtained from fn). 

The following lemma gives the optimum measurement degree, Lopt, for a given T and s. 

Lemma 2: The optimal measurement degree, Lopt, to maximize the average number of variable 
nodes, whieh ean be verified by each measurement for a given T and sparsity order s in the 
sum verification decoder, can be approximated as follows: 


T 


-'opt 


21 og(l - s) 


I 


( 11 ) 


where [.] is the ceil operator. 

The proof of this lemma is provided in Appendix From ( [TT] ) it can be noted that the 
optimum measurement degree is a decreasing function of the sparsity order, s, as | log(l — s)| 
is an increasing function of s. This means that to guarantee that a measurement can verify the 
maximum number of variable nodes form a very sparse signal, the measurement degree should 
be very large. Moreover, a binary signal with a higher s require a lower measurement degree, as 
the number of nonzero variable nodes is relatively high; thus, each measurement is connected to 
a relatively large number of nonzero variable nodes. Fig. shows the optimized measurement 
degrees for different sparsity orders and different values of T. 


C. The number of measurements 

In the proposed reconstruction algorithm, we can verify a large number of variable nodes, 
including zero and nonzero signal components, from each measurement because a majority of 
the measurements are connected to a very small number of non-zero elements of the sparse 
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Fig. 5. The error rate versus the sampling rate for the sparse signal of length n = 1000, and the sparsity order s = 0.1. 

signal. The following lemma gives an upper bound on the number of measurements required to 
reeover the sparse signal by using the proposed reconstruetion algorithm (see Appendix for 
the proof). 

Lemma 3: We consider the binary AFCS, where the measurements are generated by the AFCS 
process with the measurement degree Lopt calculated in ( [TT] ). Let m denote the minimum number 
of measurements required for the signal of length n and sparsity order s, to be completely 
reconstructed by the sum verification decoder. Then, m is of O (—nlog(l — s)). 

It can be easily proven that the propose scheme requires fewer measurements compared to the 
existing binary CS approach ll^ . which the number of required measurements is of 0{k log(n)). 
This is because \og{x)/x is a decreasing function of x when t > e; thus, —n log(l— s) < k log(n) 
for k <n — e. 

D. Performance Evaluation of noiseless AFCS 

Fig. 1^ shows the error rate versus the sampling ratio for the proposed AFCS scheme, the 
-minimization approach ifTTI . and the binary CS approach in ll2Tll . when the signal length is 
n = 1000 and the sparsity order is s = 0.1. Here, the error rate is defined as the ratio of the 
number of unrecovered signal components and the signal length n. It is important to note that the 
decoding process in [1211 is similar to the sum verification decoding algorithm with T = 1. As can 
be seen in this figure, at sampling ratio of 0.15, the AFCS scheme with L = 20 achieves the error 
rate of 10“^; however, the error rate of the binary CS approach in ETl with the measurement 
degree L = 30 is in the order of 10“^, and the error rate in binary -minimization (ITTl is in 
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Fig. 6. The minimum sampling ratio required to successfully reconstruct a binary sparse signal of length n = 1000 versus the 
sparsity order. 

the order of 10“^. In other words, the AFC seheme with an even smaller measurement degree 
outperforms the existing binary CS sehemes. The reason is that in -minimization approaeh 
and binary CS scheme in [|2T]I . the variable nodes are selected uniformly at random to generate 
each measurement; thus, even with a large number of measurements some variable nodes may 
still not be selected by any measurement, leading to a higher error rate. However, in AFCS, 
the variable node degrees are uniform so the probability that a variable node is not selected by 
any check node is zero. Thus, AFCS can approach very small error probabilities even with a 
smaller measurement degree and decoding parameter T. Furthermore, as can be seen in Fig. 
the proposed AFCS approach with L = 25 has a lower error rate compared to the AFCS scheme 
with L = 20 and L = 30, due to the fact that the optimum measurement degree for sparsity 
order s = 0.1 and T = 1 is about L = 25; thus, with measurement degrees higher or lower than 
L = 25, the error performance will be degraded. 

Fig. shows the number of measurements required to successfully recover the binary sparse 
signal versus the sparsity order. As can be seen in Fig. the number of measurements for the 
proposed AFCS scheme is significantly lower than that for the binary CS scheme in jlTl. In 
fact, the number of measurements for the binary CS scheme in ll2Tll is of 0{k\og{n)) and that 
for the proposed AFCS scheme is of Cl(—nlog(l — |)). 

It is important to note that in the proposed reconstruction algorithm, the maximum value of T 

fL\ 

is considered to be 2. Thus, for each measurement the total number ofl + L+lj comparisons 
should be performed to recover the connected variable nodes. As a result, the total number of 
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comparisons is of 0{L?‘). As we have shown that the number of required measurements is 
of C>(— ?7,log(l — s)) and the optimum measurement degree is Lopt ~ — l/log(l — s), then the 
reeonstruetion eomplexity of the proposed algorithm is of 0{—n/ log(l — s)). This indieates that 
the eomplexity of the proposed algorithm inereases when the number of nonzero elements of 
the sparse signal is very small. In faet when s goes to zero, —1/ log(l — s) ean be approximated 
by 1/s, so the eomplexity of the deeoder will be of 0 {n^/k). 

IV. Noisy Binary Compressive Sensing 

In this seetion, we eonsider the problem of binary eompressive sensing in the presenee of 
additive white Gaussian noise (AWGN). In the presenee of the noise, AFCS measurements are 
given by 

y = Gb + z, (12) 

where z is an m by I veetor of zero mean additive white Gaussian noise with varianee a^Im 
and Im is the m x m identity matrix. In [l22l . we proposed a belief propagation (BP) deeoder 
for analog fountain nodes, whieh is aetually a modifieation of the deeoding algorithm originally 
proposed for sparse signal reeonstruetion in [|^ . Here, we use the same deeoder to reeonstruet 
the binary sparse signal in the presenee of additive white Gaussian noise. In eaeh iteration of the 
BP deeoder, messages (eonditional probabilities) are passed from eheek to variable nodes and 
then from variable to eheek nodes. Let qfi{0) and gjf (1) denote the messages that are passed 
from variable node j to eheek node i in the iteration of the message passing algorithm. 
Similarly, the messages passed from eheek node i to variable node j in the iteration are 
denoted by m-^^(O) and Message mfj{n) is ealeulated as follows for r G {0,1}: 

= p{bj = r\yi) = p( ^ bfgij^ = Vi - 
j'&N(i)\j 

where Af{i)\j is the set of all neighbors of eheek node i exeept variable node j. Also, message 
qfi{r) is ealeulated as follows. 

(ifhr) = Cji n ^ ^ {O’ 

where Ai{j)\i is the set of all neighbors of variable node j exeept eheek node i, and Cji is 
ehosen in a way that qfi{0) + gjP(l) = 1. After a predefined number of iterations, ealled F, the 
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Fig. 7. Minimum sampling ratio required to successfully recover a sparse binary signal of length n = 1000 versus the SNR, 
when the signal sparsity order is s = 0.1. 

final probabilities for variable nodes are given by: 

pf\r) = Cj JJ mSf(r), 

ieMij) 

where Cj is ehosen in a way that pj ^(0) +Pj '^(1) = 1. Variable node bj is then deeided to be 
0 if > p^^\l); otherwise, it is set to 1. 

Fig. 1^ shows the minimum sampling ratio (/3) required for the sueeessful reeonstruetion 
of all fc-sparse binary signals (k = 100) of length n = 1000 versus the SNR. As ean be 
seen in this figure, by increasing the measurement degree L the performance of the AFCS 
scheme is improved. More specifically, when SNR = 30 dB, the AFCS scheme with L = 12 
requires 160 measurements, while it requires 200 and 240 measurements when L = 10 and 
L = 8, respectively. Note that the reconstruction algorithm proposed in [|^ only works in the 
noiseless settings. So, we only compare the proposed AFCS scheme with the -minimization 
approach liTTll for the noisy case. As can be seen in Fig. the AFCS scheme with even a lower 
measurement degree always outperforms the f'l-minimization approach across all SNR values. 
The minimum sampling ratio for the proposed AFCS approach with non-uniform measurement 
matrix, where the variable node degree is not fixed, is also shown in Fig. As can be seen in 
this figure, the proposed AFCS approach with non-uniform variable node degrees and a very 
smaller measurement degree (L = 12) performs very close to the binary ii scheme with a very 
large degree (L = 30). This shows the superiority of the proposed scheme over conventional 
binary CS scheme for different measurement matrices and SNR values. 
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V. Sparse Event Detection in WSNs using AFCS 


Recently, CS theory has been widely used in wireless sensor networks [l33l |34l |35l |36l |371 
and shown a great performance improvement in terms of network lifetime, energy efficiency and 
overall system throughput. Here, we consider the sparse event detection problem in WSNs, where 
a set of monitoring sensors try to capture a small number of active event sources amongst a 
large number of event resources, randomly placed inside the monitoring region. More specifically, 
we show how the proposed AFCS approach can be effectively used in WSNs for sparse event 
detection. 

A. Problem Definition 

We consider n event sources, Ei, E 2 ,..., E„, randomly placed in a monitoring region S with 
a total area of S as depicted in Fig. Each event source Ej randomly generates an event e* to 
be measured by the sensor nodes. Here, an event is defined by a binary signal, which is either 0 
or 1 (or a constant value). We refer to those events which have the value of 1 as active events, 
where the number of simultaneously active events, denoted by k, is much smaller than n, i.e., 
k <^n ll3^ . Eet e^xi denote the event vector, where each of its elements, e* has a binary value, 
i.e., ej e {0,1}. Then, e is a /c-sparse binary signal of dimension n, i.e., ^i = k <^n. 

The event sources are captured by m active monitoring sensors, which are deployed in the 
monitoring region S. The sensing range of each sensor is assumed to be Rs, which indicates 
that a sensor can receive the signal from event sources within a distance of at most Rg. In 
this paper, we consider two deployment scenarios. In the first scenario, referred to as random 
deployment, sensors are randomly deployed in the monitoring region. In this approach, some 
event sources may not be covered by any sensors. As a result, these events cannot be detected. In 
the second scenario, referred to as uniform deployment, sensors are uniformly distributed inside 
the monitoring region to fully cover the whole monitoring area. Eet Rj denote the channel 
between the event source and the sensor node, then the received signal yj at the 
sensor node can be written as follows: 



(13) 


where Sj is the set of event sources inside the sensing range of the sensor node, and ej is 
the thermal measurement noise at the sensor node. The received signal vector y^xi then 
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Fig. 8. m active sensors are capturing k out of n active events. 

be written as follows: 


ymxl H^xnCnxl ^mxli (14) 

where H = and e = [e*]!<,<„. 

The sink node is assumed to have full knowledge of the channel matrix H and all sensors’ 
readings are sent to the sink node via a wireless channel at SNR 7 iSll. More specifically, 
we assume the whole network is fully connected, which means that the sensor nodes can 
communicate with each other in the local communication range and each sensor node can 
correctly receive its neighbor’s message ifT^ . Generally, we assume that sensors’ readings are 
relayed by the sensor nodes towards the sink node and they will be received at the sink node at 
SNR 7. The received signal at the sink node can then be shown as follows: 


cl = y 


mx 1 


^mxl) 


(15) 


where z is the m x 1 additive white Gaussian noise (AWGN) vector with zero mean and variance 
cr^, and 7 = ||He|p/(T^. This model has been widely used for sparse event detection in WSNs 


1. We will further discuss about the practical issues of this model in Section VI and 
propose some approach for its practical implementation. 


B. Sparse Event Detection in WSNs as an AFCS 

Since event sources generate binary signals, the event vector e can be considered as a vector 
of binary information symbols. As we assume that each sensor node receives signals only from 
active event sources inside the sensor’s coverage area, the number of nonzero entries in each 


row of H is small. Thus the sensing process in (151 is equivalent to the measurement process in 
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AFCS, where the ehannel gains are eonsidered as weight coefficients in the measurement matrix. 
More specifically, the sensor degree is defined as the number of event sources inside the coverage 
area of a sensor node, determining the number of nonzero elements in the respective row of the 
measurement matrix. We further define the event degree as the number of sensor nodes which 
have covered that event source. The following lemma gives the probability distribution of the 
sensor degree in the uniform deployment scenario and gives a lower bound on the minimum 
number of sensor nodes required for having a minimum event degree of at least 1. 

Lemma 4: Let n denote the number of event sources in the monitoring region S, and let m 
denote the number of sensor nodes with coverage radius Rs uniformly placed inside S to detect 
active event sources. Let ^ denote the probability that the sensor degree is d. Then, is 
given by: 


P P 

^u,d = aif{n, d, P) + a 2 f{n, d, —) + a^f{n, d, —), 


(16) 


where P = tiRI 


l/S, f{n,d,P) = —= (m — 4 — 4(y^ — 2))/m, a 2 = 4/m, 

and = {m — A{y/rri — 2))/(m). Moreover, to guarantee that each event source is covered by 
at least one sensor node in the uniform deployment scenario, the following condition should be 
satisfied: 






m> r7^r+ L;^J+l 


2P 


2P., 


(17) 


where [.] and [.J are the ceil and floor operators, respectively. 

The proof of this lemma is provided in the Appendix In the random deployment scenario, 
where both event sources and sensor nodes are randomly placed inside the sensing field, the 
sensor degree distribution Vtr^ and event degree distribution Kr,d, defined as the probability mass 
function of the sensor degree and that of event degree, respectively, are given by: 

P\i - PT-\ K,d = P\i - PT-\ 

where P = tiP^/S. This arises from the fact that the probability that an event source is located 
inside the coverage area of a sensor node is the ratio of the coverage area of the sensor and area 
of the monitoring region. Since the locations of sensor nodes and event sources are independent, 
the number of event sources inside the coverage area of a sensor node is a binomial random 




23 



Fig. 9. The degree distribution of the sensor and event sources for a WSN consisting of m = 64 sensors with coverage radius 
Ra — 20 m to capture n = 256 event sources within a rectangular field of dimension 200 m by 200 m. Solid lines show 
simulation results and dashed lines show analytical ones. 


variable. Similarly, we can show that the event degree follows a binomial distribution with 
parameter m and success probability P. 

It is important to note that in the random deployment scenario, the probability that an event 
source is not covered by any sensor node is = (1 — P)™. It is then straightforward to show 
that in the random deployment scenario, to guarantee that each event source is covered by at 
least one sensor node with the probability of at least 1 — C > Aq < C> the number of sensor 
nodes should satisfy the following condition: 


m > 


ln(C) 


(18) 


ln(l - P) ’ 

where C > 0 is a given real number. Fig. shows the sensor and event degree distribution in 
a WSN for both random and uniform deployment scenarios, where simulation and analytical 
results are very close. As can be seen in this figure, in the random deployment scenario the 
probability that an event source is not covered by any sensor node is relatively large (about 
0.15); thus, those events cannot be detected during the event detection process. However, in 
the uniform deployment scenario, as long as condition ( [T7] ) is satisfied, each variable node is 
covered by at least one sensor node; therefore, the active events can be detected with a higher 
probability compared to the random deployment scenario. 
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C. Analysis of the False Detection Probability 

As previously stated, the objeetive of the deeoding algorithm is to find the eonditional proba¬ 
bilities of the variable nodes for a given measurement matrix and the reeeived signals. Sinee we 
consider the additive white Gaussian noise with variance af the conditional probabilities can 
be written as follows: 


p(e|x, H) = p(z = X — He|H, x) 




TTCrf 


JJexp 

i=i 


2al 


and the decoding objective is to find e in a way that p(e|x, H) or equivalently log (p(e|x, H)) is 
maximized. Then, we have [|22l: 


e = argmin | |x — He| I2 

e 


(19) 


The following lemma gives an upper bound on the probability of false detection (PFD), defined 
as the probability that a non-active event is falsely detected as active. 

Lemma 5: Let pe denote the probability of false detection in a WSN consisting of m sensor 
nodes with coverage radius Rg, which are placed in a monitoring region of area S. Then, p^ is 
upper bounded as follows for a given noise variance 


Pe<Q { —\JmP{l - P)R] 


( 20 ) 


where P = ttR^/S and Q{x) = e "2 dx. 


The proof of this lemma is provided in Appendix It is important to note that from (201 
we can conclude that the probability of false detection in our proposed approach is very small. 
As an example, for a WSN consisting of m = 50 sensors randomly placed in 500 m by 500 m 
monitoring area with coverage radius Pg = 50 m, the probability of false detection is about 
10-5 at 7 = 20 dB. 


VI. Practical Considerations in the Sparse Event Detection in WSNs 

In this section, we address some practical issues in the system model, which have been 
commonly used for sparse event detection in WSNs. We modify the system model and consider 
more practical system setup to overcome these practical implementation issues. More specifically. 
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we show that with the new system set up the problem is the same as the original problem defined 
in (14) and ( [T5| ), but with more praetieal assumptions. 

Many existing sehemes [l39l l40l applied CS techniques for event detection in wireless sensor 
networks. However, all of them directly apply the conventional compressive sensing algorithms. 
For compressive sensing to be applicable, they assume that the possible locations at which events 
may occur are known and the number of active events is also available. Furthermore, they also 
make the impractical assumption that each node is able to monitor every possible location. This 
essentially means that any sensor node covers the whole monitoring area, which is not true 
in reality. Therefore, these existing schemes are inapplicable for detection and localization of 
real-world events that may occur anywhere in the monitoring area, and cover limited region due 
to quick signal attenuation. More importantly, the number of occurred events is changing over 
time and unknown in advance. 

Unlike the conventional event detection approach for WSNs, we do not assume anything about 
the number of active events or their locations. We only assume that the value of each event source 
is either 0 or 1, where an active event has a value of 1, otherwise it is 0. As the sensor nodes 
do not know the location of the event sources, we assume that the monitoring region has an 


area of S and is virtually divided into n = x Uy small grids as shown in Fig. 10 Thus, the 
location of an event can be approximated by the centroid of the grid containing the event. Let 
Sj denote the set of grids inside the coverage area of sensor node j, then the received signal at 
sensor node j can be shown as follows: 


= + ( 21 ) 

where €j is the thermal measurement noise at the sensor node, dij is the distance between 
the sensor node j and the center of grid i, a is the propagation loss factor, is a fixed signal 
gain factor of sensor nodes, and e, = 1 if an active event is located in grid i, otherwise it is 
zero. This model has been widely used in WSNs [|4Tl l42l l43l l44l and verified through practical 
experiments in BOl for the environment temperature. More specifically, the temperature 1 cm 
away from the heater was measured to be 60.0° C, which is set as the source signal strength. 
The environmental temperature of the day conducting the experiments was about 30.4° C. The 
experimental results in this paper have shown that the measured temperature at distance d to the 
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Fig. 10. Event detection in WSNs using the grid approach. Squares show sensor nodes and black circles show active event 
sources. 

heater follows the following model: 


t = 30.4 + 


60 


whieh has been verified by the real measurements. This model ean be further used when we 
assume that event sourees are randomly distributed in the sensing field and event sourees generate 
a signal of the same power Pq when they are aetive. The measured signal power at distanee d 
of eaeh event souree ean also be modeled by Pd = Po/d°', where a is the path loss exponent. 
This model ean be used in the environment that the wireless ehannel is modeled by path loss. 

Sensor nodes transmit their observations along with their IDs to the sink node in a time 
divided manner, in whieh the total transmission time is divided into m time slot and sensor j 
transmits its observation in the time slot. The ehannel between sensor node j and the sink 
node is known at the sink node whieh ean be simply obtained through the periodieal ehannel 
estimation at the sink node. Sinee the ehannel varies very slowly in sensor networks, the aeeurate 
ehannel state information ean be aequired by pilot symbols in WSN P31 . It is important to note 
that as the sink node knows the loeation of eaeh sensor node, it knows the grids whieh are 
inside the eoverage area of eaeh sensor node. Then, it ean ealeulate the distanee between the 
sensor nodes and the eentre of eaeh grid. We ean also assume that eaeh sensor node knows 
its loeation through embedded Global Positioning System (GPS) reeeivers or some loealization 
algorithms [|40l. Many other multi-hop transmission sehemes ean also be eonsidered to deliver 
the measurements to the sink node, whieh are eommonly used in WSNs. As these approaehes 
have been widely studied in the literature and are out of seope of this paper, we refer the 
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interested readers to POl and referenees therein for further details. 

We ean then use the same system model as in (14), where hij = and n is the number 

of grids. In faet, the sparse event deteetion ean be modeled as a binary eompressive sensing, 
where the number of measurements equals to the number of sensor nodes and the length of 
the binary sparse signal equals to the number of grids. It is then obvious that by inereasing the 
number of grids, eaeh aetive event falls inside one and only one grid. Sinee the number of aetive 
events is fixed (say k), by inereasing the number of grids n, the binary veetor e beeomes sparse. 

We also make the praetieal assumption that the sensor network is roughly time synehronized. 
Note that we do not require aeeurate time synehronization. Many existing algorithms have been 
developed for praetieal time synehronization in large seale sensor networks. For example, FTSP 
Boll ean aehieve eloek synehronization aeouraey as mueh as 2.24 /xs by exehanging a few bytes 
among neighbors every 15 minutes. Sinee the event deteetion period is normally tens of seeonds 
to tens of minutes. Thus, FTSP is suffieient for providing the needed time synehronization 
aeeuraey ll^ . 

It is very important to note that we assume that event sourees are randomly distributed in 
the sensing field and only a small number of them are aetive. We also assume that eaeh event 
souree is aetive independently of other event sourees. However, the measured signals at sensor 
nodes are eorrelated, as eaeh sensor node has a sensing range whieh eovers a small area of the 
sensing field whieh ean overlap with other sensors’ eoverage areas. As we assume that the signal 
of interest is binary, i.e., eaeh event souree generates a signal with the known power when it 
is aetive, this eorrelation ean be eaptured in the respeetive bipartite graph at the sink node. In 
faet, as the sink node knows the loeation of sensor nodes, whieh is a valid assumption and ean 
be obtained through GPS data [[40ll . it also knows the eoverage area of them, thus it ean build 
a bipartite graph representing the eonneetion between sensor nodes and event sourees. If we 
assume that sensor nodes and the eentroid of grids are shown respeetively by eheek and variable 
nodes, then a variable node is eonneeted to a eheek node if and only if the respeetive grid is 
loeated in the eoverage area of the respeetive sensor node. In this way, sensor nodes whieh are 
eloser to eaeh other have more overlapped area, thus they have more eommon variable nodes 
(they are eonneeted to a larger number of eommon variable nodes) in the bipartite graph. In 
other words, the eorrelation between sensing data ean be well presented in the bipartite graph 
and has been already eonsidered in the BP deeoding and the respeetive analysis. 
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Fig. 11. Probability of correct detection versus the signal sparsity order and sampling ratio, when Rs = 50m and n = 256 
event sources are randomly placed inside a 500 m by 500 m sensing field. 

VII. Simulation Results 

In this section, we investigate the performance of the proposed AFCS scheme for sparse 
event detection in WSNs. More specifically, we consider two cases. First, a noiseless case is 
considered, where the sensor readings are noiseless, so the proposed sum verification decoder 
can be effectively used for the signal reconstruction from noiseless measurements. Second, we 
consider a general noisy case and simulate the performance of the AFCS scheme for different 
SNRs and signal sparsity orders, where the belief propagation decoder is used for the signal 
reconstruction. Here, the sampling ratio, (3, is defined as the ratio of the number of sensors, m, 
and that of event sources, n. We use the probability of correct detection (PCD) as the performance 
metric, where PCD is defined as the percentage of correctly detected active events [|^ . 

Fig. shows the probability of correct detection as a function of signal sparsity order and 
sampling ratio, when Rg = 50 m and the sum verification decoder is used with parameter T = 2. 
As can be seen in this figure, the uniform deployment requires a less number of sensors to be 
deployed in the sensing field compared to the random deployment. This is because that in the 
random deployment scenario, the probability that an event source is not covered by any sensor 
node is relatively high even when the number of sensor nodes is large. Moreover, as shown in 
Fig. when the sparsity order increases, the sum verification decoder requires more sensors’ 
readings to detect all active events. The reason behind this is that when the number of active 
events increases, the number of active events connected to each sensor node also increases, 
thus the sum verification decoder cannot detect them. Moreover, when the sensors are randomly 
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Fig. 12. Probability of correct detection versus the sensor coverage radius and sampling ratio, when only fc = 10 out of 
n = 256 event sources are simultaneously active. Event sources are randomly placed inside a 500 m by 500 m sensing filed. 

distributed, the degree of eaeh event souree follows the Poisson distribution, whieh results in a 
non-uniform error proteetion of the event sourees. In other words, a fraetion of event sourees are 
covered by a large number of sensors, but the rest of event sources may be covered by a very 
small number of sensors; thus, they have a lower chance to be recovered from the measurements. 
It is important to note that the probability of false detection (PFD) is zero for the proposed sum 
verification decoder as it only detects the active events and other events are always detected as 
zero. 

Fig. shows the PCD versus Rg and the sampling ratio for the case that n = 256 events 
sources are randomly distributed within a 500 m by 500 m sensing field, where only k = 10 
event sources are active. As can be seen in this figure, the proposed sum verification decoder 
can completely detect the active events in the uniform deployment scenario even at very small 
sampling ratios when the coverage radius of sensors is larger than 40m. In fact, by increasing 
the sensing coverage radius, active event sources are covered by more sensor nodes and thus 
they can be detected with a higher probability. However, as can be seen in this figure, in the 
random deployment scenario active event sources cannot be complectly detected even with a 
large sensing coverage radius or high sampling ratio. 


Fig. 13 shows the minimum sampling ratio required to successfully detect all active events 


versus the signal sparsity order s and Rg for the uniform deployment scenario. As can be seen 
in this figure, with increasing s, a larger number of sensors have to be deployed in the sensing 
field. This is because of the fact that a larger number of active event sources are connected to 
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Fig. 13. Minimum sampling ratio (/3) required to successfully detect active events amongst n = 256 event sources randomly 
distributed within an area of 500 m by 500 m versus Ra and s. The sensor nodes are uniformly placed inside the sensing field. 

each sensor nodes; thus, the proposed sum verification decoder with T = 2 requires a larger 
number of measurements for sparse event detection. Moreover, when Rg increases, a smaller 
number of sensor nodes is required to be deployed in the sensing field to successfully recover 
all active events. However, when the signal sparsity order increases, a lower number of sensor 
nodes is required when the sensor coverage radius is low compared to the case that the sensing 
coverage radius is high. 

We now compare the performance of AFCS with Bayesian compressive sensing (BCS) li25ll 


shows PCD versus the sampling ratio at different SNR values, when n = 1000 event sources 
are randomly distributed in an area of 1000 m by 1000 m, k = 10, and a = 3. As can be 
seen in this figure, the proposed AFCS scheme outperforms both BCS [l25]l and SCS ll2^ in 
terms of PCD for different SNRs. Moreover, the number of required sensor nodes in SCS is 
relatively large (500), which significantly increases the total number of transmissions, leading 
to poor energy efficiency. It is important to note that the existing approaches need to know the 
number of active events to achieve a relatively good performance in terms of PCD; however, 
the proposed AFCS approach does not need to know the sparsity level of the events and can 
work for various numbers of active events. Moreover, the achievable PCD for the AFCS scheme 
with random deployment is very close to that for the SCS approach Il26l . and so we removed 
the respective curve from the figure for a clearer representation. 


and sequential compressive sensing (SCS) [|26ll at different SNRs and sampling ratios. Fig. 14 
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Fig. shows the minimum sampling ratio versus SNR in order to aehieve a PCD larger 
than or equal to 99%, when a = 3 and n = 256 event sourees are randomly distributed within 
500 m by 500 m area. As ean be seen in this figure, in the uniform and random deployment 
seenarios, the minimum number of nodes approaehes the lower bounds in ( [T7] ) and (18), when 
SNR inereases. Moreover, in low SNRs, the proposed approaeh ean deteet the aetive events with 
a large number of sensor nodes, where a WSN with sensors with a high eoverage radius is more 
robust to the noise effeet in terms of the sampling ratio. Note that in the random deployment 
seenario, more sensor nodes are required as some event sourees may not be eovered by any 
sensor nodes, leading to a lower pereentage of eorreet deteetion. 


VIII. Conclusion 

In this paper, we proposed an analog fountain eompressive sensing (AFCS) approaeh for sparse 
reeovery of the binary sparse signal. In the proposed approaeh, eaeh measurement is generated 
from the binary sparse signal in a way that the variable node degrees are almost the same. We 
proposed a generalized verifieation based reeonstruetion algorithm, whieh enables a measurement 
eonneeted to at most T nonzero signal elements to reeover their eonneeted variable nodes. The 
measurement degree was also optimized to maximize the number of the verified variable nodes 
by eaeh measurement. We showed that the number of measurements required for the sueeessful 
reeovery of the sparse signal is of C>(— ?7,log(l — s)), where n is the signal length and s is the 
sparsity order. Simulation results showed that the AFCS seheme signifieantly outperforms the 
eonventional binary CS and £i-minimization approaehes in terms of the error rate for different 
sparsity orders. We further showed that the sparse event deteetion problem in wireless sensor 
networks ean be represented by AFCS. Simulation results showed that the proposed AFCS 
approaeh outperforms the existing sparse event deteetion sehemes in WSNs in terms of the 
probability of eorreet deteetion in a wide range of signal to noise ratios with a mueh smaller 
number of sensor nodes and a negligible probability of false deteetion. 


Appendix A 
Proof of Lemma [I] 

Let and denote the probability that a eheek node ean verify its eonneeted variable 
node as zero and nonzero variable node, respeetively, after i iterations of the deeoding algorithm. 
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Fig. 14. Probability of correct detection (PCD) versus the sampling ratio (/3) at different SNRs, when n = 1000 event sources 
are randomly distributed in an area of 1000 m by 1000 m and fc = 10 events are simultaneously active. 



Fig. 15. Minimum sampling ratio for the proposed AFCS scheme to achieve a PCD higher than 99%, when n = 256 event 
sources randomly distributed with the 500m-by-500m area and only k = A events are active. Dashed line show the lower bound 
on the minimum number of sensors obtained from GU and 

Let us define /* = + f-^\ which denotes the probability that a check node can verify the 

connected variable nodes in the iteration of the sum verification decoding. It is clear that a 
SUM node at depth 2i — 1 can verify its parent at depth 2(1 — 1) as zero variable node, if one 
of the following events happened. First, all of its connected OR nodes at depth 21 have been 
verified; second, at least one of its connected OR nodes at depth 21 has not been verified, but 
the SUM node can verify the remaining variable nodes. The second event happens if the value 
of the check node is equal to the sum of at most T edges connected to the unverified variable 
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nodes. Thus, can be written as follows: 

d A 


= EE -P.-.)j Af_„ (22) 

which arises from the fact that i out of d variable nodes connected to the check node are not 
verified with probability —pi-iY~\ and j out of d — i unverified variable nodes have 

potentially nonaerc values with probability ^ ') (,f> Sinee the probability that 

an unverified variable node at depth 2{i — 1) being zero is and i and j vary from 0 to d 
and 0 to ydi = min{T — 1, d — i}, respectively, then (22) is straightforward. Similarly, can 
be calculated as follows: 


d\ fd-i\ i / (0) . J 


d A 2 


/i‘’ = EE 


3 C J 0 (1 - Pi-1)) K-l 1 


(23) 

\ t / \ 1 / ‘V"‘ / 

i=0 j=0 

where A 2 = min{T — l,d — i}. As we consider that /j = then (6) can be directly 

obtained by the summation of ([22|) and ([23j). Moreover, if a variable node receives at least one 


message of value 1 form its connected check nodes, then it can be verified. Let us assume that an 
OR node has j children with probability 6j, so it can be verified with probability 1 — dj(l — /j)L 
Then, pt can be verified by summation over j as follows: 


p, = 5^(1-0,(1-/.:)') = 1-^(1-/,). 


(24) 


As stated before, after i iterations of the verification-based decoding algorithm, a variable node 
can be verified as zero variable node with probability This means that the average number 
of zero valued unverified variable nodes is A: — Since the average number of unverified 

variable nodes in the A iterations is n — npi, then can be calculated as follows: 

(1) ^ _ 1 - go - U 

Vi 


n-Pi 


Pi 


(25) 


This completes the proof. 


Appendix B 
Proof of Lemma [2] 

For simplicity, we assume L = d + 1, then by using the fact that 


can be approximated 


by L^il for L being very large compared to i, the average number of recovered variable nodes 
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by each measurement, denoted by R, can be approximated as follows for a small T: 

T 


ji+l 

L • 


i=0 


(26) 


and the derivative of R with respect to L can be calculated as follows: 

T 


^ (i?) = ^ {{i + 1) + log(go)), 


dL 


(27) 


2=1 


where A = (1 — go)/?o- The solution for the equation dR/dL = 0 gives the optimal value of L. 
Therefore, we need to solve the following equation: 


A 


y^^L‘((i + l) + Llog(*))=0. 


(28) 


2=1 


The term inside the summation is zero, when L = _ , which gives us an approximation of 

the optimum L. A better approximation can be achieved by taking the average of ^ over 


■log(9o) 


i, which directly results in (11), due to the fact that i = (T + 1)(T + 2)/2. 


Appendix C 
Proof of Lemma [3] 

It is clear that each measurement can verify at most Lopt variable nodes; thus, the destination 
requires at least n/Lopt measurements to determine all variable nodes. Moreover, each mea¬ 
surement can verify its connected variable nodes with probability P{Lopt), and accordingly the 
average number of measurements that can recover their variable nodes is mP{Lopt)- However, 
after the first iteration of the verification-based decoding algorithm, and removing the edges 
connected to the verified variable nodes, a new set of measurements can recover their variable 
nodes in the next iteration of the reconstruction algorithm and so, the verification process is 
continued. This means that the actual number of required measurements is less than n/mP{Lopt)- 
Therefore, we have: 


n 

Lopt 

where 

T 

p{Lopt) = ^ (1 _ s)^°p‘-JsL 

j=o V 7 / 


< m < 


n 


L optPi^L opt) 


(29) 


( 30 ) 
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As can been seen in ( [^ , p{Lopt) is an increasing function of T. Thus, we have 

PiLopt) > = (1 - 

j=o ^ / 

and by replacing ( [TT] ) in the above equation for T = 0, we have 

P{Lopt) > (1 - = e"\ 


Therefore, (29) can be rewritten as follows: 


n ne 

-— < m < -— 


■^opt 


■^opt 


and by replacing ([TT]) in (j^, we have: 


—2n log(l — s) —2ne log(l — s) 

- < m < - 


T + 2 


T + 2 


( 31 ) 


(32) 


(33) 


(34) 


This completes the proof. 


Appendix D 
Proof of Lemma |4] 

As we assume that event sources are randomly placed in the sensing field S, the probability 
that an event source is located inside the coverage area of a sensor node with a total coverage 
area of will be P = ttRI/S. Moreover, as event sources are independently distributed in 
the monitoring field, the probability that d out of n event sources exist in the coverage area of 
a sensor node will be f{n,d,P). However, as shown in Fig. four sensor nodes, which are 
located at the edges, will have a total coverage area of ttP^/T inside S and A{y/m — 2) sensor 
nodes have coverage area of ttRI/2 inside S. Thus, according to the law of total probability, the 
probability that d out of n event sources are located inside the coverage area of a sensor node 
is given by (|T^. 


As can be clearly seen in Fig. 16, the maximum distance between two neighboring sensor 
nodes is 2Rs; thus, the minimum number of sensor nodes required to cover the sensing field in 
a way that sensors coverage areas do not intersect, is [^1^- However, to cover the rest of the 


sensing field, we need at least (^[^J + ij sensor nodes. Therefore, the minimum number of 
sensor nodes to fully cover the sensing field is given by ([T7|). 
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Appendix E 
Proof of Lemma [5] 

Let us consider that signal e' has been decoded, with i false detections. Without loss of 
generality, we consider that e and e' are different in the first t places, and let pi denote the 
probability of this event, then we have. 


pe = p{\\x~lle'\\l < ||x-He||2) 


=p 1I I 

i=l 




i=l 


i=i 
2 


= p 11 -Y1 




i=l 


i=i 


(b) 

= P 




,j=i 


2=1 


j = l 


m / n 


= P i(e' - Ci) , 


j=i \ i=i 


j=i i=i 


where step (a) follows the fact that x = He + z, and step (b) follows the fact that only first i 
places of e and e' are different. As noise components Wj are i.i.d. Gaussian random variables 
with variance a^, then Yl]Li Yl\=i ‘^Zjhj,i{e[ — ei) also has a zero mean Gaussian random variable 
with variance Aal YlT=i {Y11 =i hj,i{ei — e')) . Thus, p£ can be calculated as follows: 


(35) 




1 

\ i=l \i=l 


Let Ci = Cj —e', then q = — 1 when a false detection occurs. It is then clear that pi is a decreasing 
function of i; thus, the probability of false detection, pe, which is a linear summation of piS is 
always smaller or equal to pi, i.e., Pe < Pi- Thus, we have: 

1 




i=i 


The event degree follows a binomial distribution of parameter m and success probability P = 

-- I I 

7iR‘^/S with average event degree of mP(l—P), and IE[/r|i] < Ps thus, (20) is straightforward. 
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Fig. 16. Uniform deployment of sensor nodes in the sensing field. 
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